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“You pays your money and you takes your choice.” 

— Mark Twain, Huckleberry Finn 


Abstract 

We consider dynamic pricing schemes in online settings 
where selfish agents generate online events. Previous 
work on online mechanisms has dealt almost entirely with 
the goal of maximizing social welfare or revenue in an 
auction settings. This paper deals with quite general set¬ 
tings and minimizing social costs. We show that appro¬ 
priately computed posted prices allow one to achieve es¬ 
sentially the same performance as the best online algo¬ 
rithm. This holds in a wide variety of settings. Unlike on¬ 
line algorithms that learn about the event, and then make 
enforceable decisions, prices are posted without knowing 
the future events or even the current event, and are thus 
inherently dominant strategy incentive compatible. 

In particular we show that one can give efficient 
posted price mechanisms for metrical task systems, some 
instances of the k-server problem, and metrical matching 
problems. We give both deterministic and randomized 
algorithms. Such posted price mechanisms decrease the 
social cost dramatically over selfish behavior where no 
decision incurs a charge. One alluring application of this 
is reducing the social cost of free parking exponentially. 

1 Introduction 

We consider scenarios where agents arrive and make 
decisions which influence the global environment. One 
such setting is that of online auctions, where goods or 
combinations of goods are sold over time. In this paper we 
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consider a more general setting that captures more general 
online social choice issues. 

We suggest using dynamic posted prices that dynam¬ 
ically assign a surcharge to every decision. Under the as¬ 
sumption that agents are rational, this allows one to prove 
approximation bounds with respect to optimal target func¬ 
tion. 

The competitive analysis of online algorithms typi¬ 
cally deals with problems cast as follows: 

• A sequence of events ffi, (72, ■ ■ ■ occurs over time. 

• Subsequent to observing event (7,, a centralized on¬ 
line algorithm takes some decision s, G Si, where 5, 
is the set of possible decisions that can be taken in 
response to event (7,. 

• Centralized online algorithms choose their decisions 
for event i without knowing what the future holds 
so as to maximize (or minimize), some function 
/((7,s) of the sequence of events a — Gi,... ,Gn and 
associated decisions s = si,... ,s„. 

In this paper we reinterpret the framework above by 
having strategic agents initiate events. So rather than an 
online sequence of events we have an online sequence 
of strategic agents,and rather than a centralized online 
algorithms making decisions the strategic agents make the 
decisions themselves. 

The ith agent, has some (private) type t, G 7). Con¬ 
ditioned upon previous events C7i,..., C7j_ i, and previous 
decisions si G 5i,... ,s,_i G 5,-i, an agent type t G T is a 
function t: Si i—t M. An agent of type f G 7] has value f (s) 
for decisions s G Si. Ergo, a rational agent i will choose 
decision 

Si G argmaxf,(s). 

seSi 

We remark that t may take either positive or negative 
values. In the translation of online problems to the 
strategic setting, costs translate to negative value, benefits 



translate to positive values. Note that the target function 
/ remains as defined in the non-strategic online setting. 

To motivate these strategic agents to “do the right 
thing” (from the perspective of the designer), we intro¬ 
duce dynamic posted pricing schemes that set a surcharge 
on the set of possible decisions. A dynamic posted pricing 
scheme P generates a sequence of functions {Pi,P 2 , ...). 
Dynamic posted pricing schemes determine the i’th func¬ 
tion, Pi, before event (7, arrives. 

The function Pi ; 5, i—K sets posted prices (a sur¬ 
charge) for every possible decision in Si. We assume 
quasilinearity, i.e., the utility of agent i of type f € 7) — 
should she decide upon s G Si — is 

ufs) =t{s)-Pi{s). 

Rational agents always seek to maximize their utility, or 
minimize their disutility which is the negation of their 
utility. 

Dynamic posted prices of particular interest are vary¬ 
ing tolls on express lanes and the SFPark system (SF- 
park.org) which sets parking prices in San Francisco as 
a function of parking congestion [22, 6]. Interestingly, 
our dynamic posted pricing scheme for metric matching 
on the line described below can be viewed as a refinement 
of the SFPark system. 

A very simple example for dynamic pricing, where 
prices are set once and for all (all f)’s are the same), is 
the problem of online matching on unweighted bipartite 
graphs [13]. In this problem, there is a stationary right 
side, vertices (agents) on the left arrive online along with 
edges to all adjacent vertices on the right. The target is to 
maximize the number of matches made. 

The KVV algorithm [13] performs a random permu¬ 
tation of the vertices on the right and assigns an incoming 
vertex to the first available vertex on the right. Translat¬ 
ing this into the strategic agent setting, agent types give 
a utility of one for a match with one of the adjacent ver¬ 
tices. Strategic agents can be induced to emulate the KVV 
algorithm by a pricing scheme assigning a surcharge of 
e,2e,... to the vertices on the right in some random or¬ 
der. Note that the surcharge for a vertex never changes so 
there is no “dynamic” aspect to this pricing scheme. 

Posted prices are inherently a dominant strategy in¬ 
centive compatible mechanism. Given a dynamic posted 
pricing scheme P that produces functions {Pi,P 2 ,.. .), and 
a sequence of agent types t — ti,t 2 , - ■ ■ fn, the set of deci¬ 
sion sequences in equilibria is 


eq{P,t) 


s = si, 


Si G argmaxM, (i) 

S^S; 


Because of ties it may be that |eq(P,f)| > 1. 

In the context of mechanism design it is often as¬ 
sumed that if a mechanism is dominant strategy truthful. 


the agents may as well reveal their true type to the mech¬ 
anism. Moreover, the revelation principle states that with¬ 
out loss of generality, one need consider only such direct 
revelation mechanisms. 

In many contexts, revealing the agent type is prob¬ 
lematic or absurd. When going to the supermarket one 
may observe what people are buying, it is hardly plausi¬ 
ble to interview customers at length as to what their true 
preferences are. Moreover, revealing the true agent type 
may be a grave breach of privacy, see [20]. 

One could, in principle, combine a dynamic posted 
pricing scheme yet insist that the agents reveal their true 
type. Were we to allow this, the functions P,- could depend 
on the true types of previous agents; fi,... and 
on previous decisions taken by these agents: si,...,s,_i 
(may be relevant if agents use randomization). 

To avoid the unnatural aspects of type revelation, we 
like to restrict the information available to the dynamic 
pricing scheme. This should be information that is (rel¬ 
atively) easy to collect by simple observation of the con¬ 
sequence of agent decisions without having to understand 
the causes for such decisions. E.g., one can observe what 
customers purchase, but not what they plan to cook for 
dinner or whom they are having over. We would like to 
allow dynamic pricing schemes* to observe (some) con¬ 
sequences of decisions made by (earlier) agents, but not 
necessarily allow access to the true agent types. 

The less one assumes about observable history, the 
more robust the concept. E.g., in the context of dynamic 
posted pricing for parking, it seems reasonable to place 
sensors at parking spaces so as to as to learn what parking 
spaces are free but it seems much less plausible to require 
a host of miniaturized spy drones to follow everyone about 
(so as to learn the true agent type — her true destination). 
See discussion and definitions in Section 6. 

In the context of strategic agents, we use the term 
price of anarchy [17] instead of the term competitive ratio. 
I.e., under the assumption that agents are rational, the 
price of anarchy is the ratio between the value of the 
target function / when making the best possible decisions 
and the value of / when decisions are the worst case 
equilibria. 

We distinguish between benefit problems, where the 
goal is to maximize /, and cost problems where the 
goal is to minimize /. In particular, social welfare 
maximization is to maximize the function ti{si), and 
social cost minimization is to minimize the social cost 
function — l1=i tii^i). Clearly the goals are equivalent. 

Given a dynamic pricing scheme P, target function /, 
the price of anarchy (for benefit problems) is 


( 1 . 1 ) 


mm 




o,seeq{P,<r) /(cJ,Opt(<7)) ' 


^ Or mechanisms in general. 






where 


opt(ff) G argmax f{a,s). 

s\Sj€Si for all i 

Given a dynamic pricing scheme P, and target func¬ 
tion /, the price of anarchy (for cost problems) is 


iiiaA. " ■ " 'j 

a,Xeq(P.a) /(a,opt(c7)) 

where 

opt(ff) G argmin f{a,s). 

s\Si€Si for all i 

It is common to consider the competitive ratio in 
terms of an adversary, and one can do the same in the 
context of the price of anarchy. With this interpretation, 
one may assume that the adversary knows the true types 
of the entire sequence in advance. 

To clarify the differences between online algorithms, 
online mechanisms, and online posted prices (a special 
case of online mechanisms) the following summary may 
be helpful; 

1. Online algorithms: events appear and are dealt with 
in a centralized fashion. It is assumed that events 
are “true”. Online algorithms enforce their decisions. 
There is no sense in which an event says “I don’t like 
what you’ve decided in my case.” 

2. Online mechanisms; events appear and decisions are 
made, prizes and fees set, in a centralized fashion. 
Just as with competitive algorithms, online mech¬ 
anisms can (magically?) enforce their decisions. 
There is no sense in which an event can “revolt” 
against a decision that she does not like. 

3. Dynamic posted pricing (this paper) sets a posted 
price for every possible decision. Incoming strategic 
events make their own decisions based upon their 
own self interest. “You pays your money and you 
makes your choice”. Moreover, different dynamic 
posted pricing schemes can be compared both with 
respect to the competitive ratio they achieve and with 
respect to the quality of the observable history they 
require. 

1.1 Problems and Results We consider the following 
problems 

Metrical task systems [3] Metrical task systems are 
described by a symmetric non-negative m x m matrix, 
{dst)i<s.t<m, where dss = 0 and the triangle inequality 
holds: dst < d^t + d^t for all 1 < s,f,k < m. The set 
S = {1,... ,m} is the set of states. A task is a vector of 
m non negative real values, 

W= (wi,W2,...,Wm). 


In response to the arrival of a new task, w', a centralized 
online algorithm decides upon some state i G 5. The cost 
of a sequence of decisions for a sequence of 

tasks w*,..., w" is 

Strategic metrical task systems The [dis]utility of an 
agent is the sum of transition cost (to go from state to 
state), the cost of executing the task in the state chosen, 
and the surcharge for the decision taken. It is very easy 
to see that, without introducing a surcharge, the price 
of anarchy is unbounded, strategic agents will never 
take expensive transitions. If one were to consider an 
online algorithm as a mechanism without money, agents 
will lie shamelessly. Every state other than their greedy 
preference would cost infinity. 

Here we give a dynamic posted pricing scheme with 
a competitive ratio of 0{m), within 6>(1) of the best 
deterministic competitive ratio possible. The observable 
history consists of the actual state transitions and the cost 
to serve the tasks in the states chosen. The observable 
history does not include the cost of the tasks in states that 
were not chosen. 

The k-server problem [19,16,15] The k server problem 
has k servers located in some metric space. Online 
requests arrive following which some server is to be 
moved to the request location. I.e., one needs to decide 
upon which server is to be moved. The goal is to minimize 
the sum of distances traveled. A competitive ratio of k is 
known for some metric spaces, whereas 2k — 1 is an upper 
bound for general metric spaces [16], vi. a lower bound 
of k. Closing the gap for deterministic algorithms is an 
infamous open problem. 

The strategic k-server problem The disutility of an 
agent is the distance traversed by the server. Thus, without 
a dynamic posted pricing scheme, agents will always 
choose the closest server to move, giving an unbounded 
price of anarchy. We give a dynamic posted pricing 
scheme for any metric space with 2 servers with a price of 
anarchy 10. For k-servers on the line, we give the optimal 
price of anarchy k. In this case, the observable history 
(the server positions) allows one to infer the true type of 
previous agents. 

Metric Matching [14,12] The metric matching problem 
is described on some metric space. Events are points 
in the metric space, who are to be matched to some 
unmatched point, with the goal of minimizing the sum of 
distances between the incoming points and their match. 
One application of this problem is parking, where one 
seeks to assign homeowners a parking spot close to their 






home. The problem on the line is another infamous open 
problem, conjectured to have a constant competitive ratio 
in [12] but still open. 

Strategic Metric Matching The disutility of an agent is 
the distance between the agent destination and the point 
assigned to the agent. Ergo, strategic agents will always 
choose the closest possible point. 

It follows from [14] that without surcharges the price 
of anarchy may be exponential in the size of the metric 
space. 

We give a randomized dynamic posted pricing 
scheme with a competitive ratio logarithmic in the ratio 
between the largest and smallest distances between points. 
In particular, this implies an exponential improvement in 
the price of anarchy over the price of anarchy without sur¬ 
charges. Moreover, given an estimate on the cost of the 
optimal solution, we can attain a doubly exponential im¬ 
provement over the price of anarchy without surcharges. 

2 Related Work 

Lavi and Nisan [18] initiated the study of online auctions. 
The consider a model of indivisible goods where bidders 
have a privately known valuation for different quantities 
of the goods. Agents learn their valuation at some time 
and must bid immediately. The auction must decide 
immediately how many units to allocate and at what price. 
They show that the only dominant strategy incentive 
compatible mechanisms are those that offer a price menu 
for different quantities. 

Awerbuch, Azar, and Myerson [1] give a general 
scheme that produces posted prices for general combi¬ 
natorial auctions, with a competitive ratio equal to the 
logarithm of the ratio between highest and lowest prices, 
times the underlying competitive ratio for the combinato¬ 
rial auction (which is terrible, in general). 

In [7] Friedman and Parkes introduced a model of on¬ 
line mechanism design, they seek strategyproof mecha¬ 
nisms where agents may manipulate their arrival time and 
values. They give a variant of the VCG mechanism for 
some such problems. One of the issues from this paper 
and subsequent work has been misrepresentation of arrival 
times (or departure times, or both). 

In the main, we do not consider misrepresentation 
with respect to time or position in the sequence. See 
discussion in Section 6 

In Parkes [2 1 ] a general formalism for dynamic envi¬ 
ronments and online mechanism design is given. In this 
general model a mechanism makes (and enforces) a se¬ 
quence of decisions. The decisions may depend on inter¬ 
action with agents as well as changes to the environment. 
In the online mechanism design model, agent types in¬ 
clude arrival times, departure times, and a valuation func¬ 
tion of the decisions made by the algorithm. 


Unfortunately, the positive results in the general set¬ 
ting of [21] have been limited to so-called “single valued 
preferences” where an agent gets a fixed value r iff one 
of a set of “interesting decisions” in made between her 
arrival and departure. (The positive results also require 
limited misrepresentation of the agent types). 

It is important to note that none of the problems we 
consider are single valued. For task systems, the A:-server 
problem, and the parking problem, different decisions will 
generally give different value to the agent. Also note that 
we do not allow misrepresentation of arrival time, and 
insist that departure time be equal to arrival time. 

There has been a large body of work on online auc¬ 
tions in the setting where the order of events is a uniformly 
random permutation. Hajiaghayi, Kleinberg and Parkes 
[9] discuss limited supply online auctions, and Babaioff, 
Immorlica, Kempe, and Kleinberg [2] introduced the 
knapsack and matroid secretary problems, about which 
there has been much subsequent work. 

3 Pricing States for Metrical Task Systems 

3.1 Problem Definition A metrical task system [3] is 
described by a symmetric non-negative m x m matrix, 
{dst)\<s.t<m, where d^s = 0 and the triangle inequality 
holds: dst < d^k + d^t for all 1 < s,t,k < m. The set 
5 = {1,..., m} is the set of states. 

The strategic metrical task system is a variant of 
the metrical task system where tasks are associated with 
agents. Agents arrive online, the /th agent to arrive has 
some task to perform, represented by a vector of m non 
negative real values, 

w' = (w'i,W2,...,w;„) . 

The task w‘ is private to agent i. 

The initial state of the system is sq, the state of the 
system may change over time. Fet be the state of 
the system upon the arrival of agent i. The /th agent to 
arrive may decide to change the state of the system upon 
her arrival to some other state (i.e., Si ^ s,-i) or not (i.e. 
s/ = s,_l). 

Given states s, t, and a task w = (wi,W 2 ,... ,Wm), 
define 

C{s,t,w) = Wi+dst, 

i.e. the cost to switch states from s to f and process task w 
in state f. 

A dynamic pricing scheme for strategic metrical task 
systems sets prices (surcharges) on the states of the sys¬ 
tem. Formally, a dynamic pricing scheme F — (Fi,F 2 ,...) 
is a sequence of price functions Fi,F 2 ,. .., where the /th 
function 

((u, w] J , (i2, ,..., 1, ^ X 5 K+. 

The information available to the pricing function P,- is 
the state sj chosen made by agent j and the work 


wi- expended on the y'th agent’s task w-' in that state, 
for all j < i. This is captured by the sequence 
^ (s 1 , ), (s 2 , ,..., 1 , w[-\ ) ) ■ The prices com¬ 

puted by pricing function Pi are relevant for agent i. 

For notational convenience, we use F, as a pricing 
function over the set of states after i — 1 tasks are pro¬ 
cessed, where the “history” of the system is implicit. We 
stress that the pricing is set without any information re¬ 
garding future events. 

Note that pricing function Pi cannot depend on tasks 
w-', j > i, (this is distinct from an online algorithm that 
may depend on w'). Moreover, it knows very little about 
It only knows the decisions made by 
the agents and the actual work done, i.e., sj and wij for 
1 In particular, this means that the pricing 

scheme does not know the optimal cost to process tasks 
w*,..., w'^ * and cannot compute the work function for a 
state. 

The goal of a strategic agent is to to minimize her 
disutility. This is the sum of the following components: 

1. The work required to do the task in the state selected 
by the agent. For agent i this is . 

2. The transition cost to change states. For agent i this 
is dsj_,Si- 

3. The surcharge associated with choosing (or remain¬ 
ing in) some state, this is determined by the pricing 
scheme (see below). For agent i this is P,(s,). 

Therefore, the agent will choose: 

Si e argmin{w^-f£/i._ji-ff)(i)} 
s^Sj 

As the goal in metrical task systems is cost minimization, 
the target function / is the social cost, 

1^1 

/(^a) = ^ , 

i=l 

i.e., the sum of all state transitions and task processing 
costs. The price of anarchy is as dehned in Equation (1.2). 

3.2 The Fractional Traversal Algorithm for Metrical 
Task Systems We now describe the fractional traversal 
algorithm for metrical task systems from [3]. The frac¬ 
tional traversal algorithm solves a relaxation of the origi¬ 
nal problem in that it allows states to perform fractions of 
a task, possibly switching states repeatedly until the task 
is completed. 

A traversal sequence, T = Ti, T 2 ,..., is an inhnite se¬ 
quence of states of the task system, where every state 
of the task system may appear several times in the se¬ 
quence. The fractional traversal algorithm given here as 


Algorithm 1 is very simple, it executes a fraction of task 
until the work done in the current state, since moving to 
this state, equals the distance to move to the next state in 
the sequence. 

Borodin et al. showed the following with regard to 
the traversal algorithm: 

Theorem 3.1. (Borodin ET AL. [3]) • There 

exists a traversal sequence such that the fractional 
traversal algorithm has a competitive ratio of 
8{m— 1). (Versus the optimal competitive ratio of 
2m — 1 achieved by the work function algorithm). 

• Given any online algorithm that schedules tasks 
fractionally on multiple states with competitive ratio 
c, there exists another online algorithm that executes 
the task in a single state with competitive ratio < c. 

The following variables are used in the traversal 
algorithm (Algorithm 1): 

• The current position in the traversal sequence T is 
indicated by the variable j. 

• Task w' is (fractionally) executed in states 

■ ■ ■ ,'^tr 

1. e., during the processing of task w' the index j takes 
values f,_i,. 

• The fraction of task w' that is executed in state Tj is 

denoted by Xj. Note that = 0 for alH ^ 1 ,..., ti. 

Note too that there may be fG f,_ 1 ,...,f,-, — > 

2, such that = %. 

• The variable pj when dealing with task i represents 
the work expended thus far in the y'th position of T. 

pj= I 

An illustration of the execution of Algorithm 1 is 
given in Appendix A. 1. 

Given a traversal sequence T — Ti,T 2 ,..., dehne the 
traversal distance between indices £ and £' to be 

max{e/)-l 

Se,e' = ■ 

Note that 

• The total work done by the fractional traversal algo¬ 
rithm after completing n tasks is exactly + Pt,,. 
This follows as the algorithm advances to the next 
state in the traversal sequence when the amount of 
work done in the current state is equal to the distance 
to the next state. 


Algorithm 1 The Fractional Traversal Algorithm. 

Input: A traversal sequence T, a set of states S, the 
function d, and an online sequence of tasks wi,W 2 ,.... 
For each task i: 
j ^ h'-l 

while K < 1) 


• X'j ^ min 


Pj 1 _3' 


• PjPj+^j'^^Tj 

• if Pj= d-Cj,Xj+i then ; ^ + 1, P; ^ 0 


h' ^ j 


• The total traversal cost to the fractional traversal 
algorithm for n tasks is exactly every transition 
from Tj to Ty+i has transition cost djj+i. 

We now show that a Traversal Algorithm’s perfor¬ 
mance is monotone with respects to the tasks it receives. 


Lemma 3.1. Let X be some traversal sequence, and let 
w = (w*,..., w") and w = (vv*, •. .vv") be two sequences 
of tasks such that for every i G and every s G S, 

we have that w\ < w\. The total work done by the traversal 
algorithm with respect to X and w is at least the amount 
of work done with respect to X and w. 

Proof Let f,- be the position of the traversal algorithm 
after processing task i in the task sequence w, and f, 
the position of the traversal algorithm after processing 
task i in the task sequence w. In addition, let be the 
amount of work done by the algorithm since index f, given 
sequence w, and Pf the corresponding amount of work 
since reaching f, and given task sequence w. 

We show by an inductive argument that for every 
i G {0,...+ p,. > 5fgj. -I- Pi-. This is clearly true 
for i = 0. Let the claim hold before the arrival of the /-th 
task. If the traversal algorithm that processes sequence w 
doesn’t reach f, when processing task w‘, {tj < tj) then 

^to,fi+Ptj < + l 

- ^to,tj+Pti- 

where the 1st derivation above is by the definition of the 
algorithm, the 2nd is by assumption that tj < t i and the 3rd 
is since pj > 0 for all j. 

Lets assume that tj > tj. We’ll show that it cannot be 
that tj > tj, and when fj = tj we show that pt- < Pt ^, thus 
concluding the inductive proof. 


Because w', > for every state s, we have that for 
every ;G ... ,f,-- 1}, 

(3.3) 

where Aj and Aj are the fractions of tasks vv, and w‘ done 
in Xj. This follows from the definition of A in the frac¬ 
tional traversal algorithm and the inductive hypothesis. 
Therefore, 

(3.4) XI < i- t ^ 

j^U-l 

(3.5) < 1 - 'x;' Aj 



Inequality (3.4) follows since the fraction left over cannot 
be more that one minus the fractions already done. In¬ 
equality (3.5) follows from Equation (3.3), and the final 
equality is by definition. 

Therefore, 

Pt, = Aj. • Wi (Xt,) < Aj • w‘x^_ = Pt. < , 

so tj = tj, and we can conclude the induction. 

□ 

Lemma 3.1 immediately implies that the total cost of the 
fractional traversal algorithm given task sequence w is 
smaller than the cost of the fractional traversal algorithm 
given task sequence w. This is since for every 

i G {0,...,n}. 

3.3 ‘Tollow the Traversal” Algorithm We would like 
to come up with a dynamic pricing scheme for metrical 
task systems for which agents driven by self interest 
follow the traversal sequence. 

We now describe an new algorithm which can be 
simulated by a dynamic pricing scheme that encourages 
greedy agents to “follow” the fractional traversal algo¬ 
rithm. The algorithm, for a given metrical task system (a 
set of states S and a distance metric d : S x S i-G K+) and 
traversal sequence x, receives and processes a sequence of 
tasks w*, ... online. 

Roughly speaking, after processing each task, the 
algorithm simulates the traversal algorithm presented in 
Section 3.2. When handling task i, the algorithm views 
all states between its current position in the sequence, and 
the fractional traversal algorithm’s current position in the 
sequence, f,_i, as a superstate where the cost of a task in 
a superstate is the min cost over its constituent states. The 
details of the algorithm are given in Algorithm 2. 

Even though there may be several indices that mini¬ 
mize c, tie breaking can be done arbitrarily, since all the 
claims hold for every index which minimizes c. 






Algorithm 2 “Follow the traversal” algorithm that allows 
going to all the states between its current state and the 
current state of the traversal algorithm “for free”. 

Input: A traversal sequence T, a set of states S, a distance 
metric d and an online sequence of tasks .... 

Let 1 denote the position of the algorithm in the traver¬ 
sal sequence after completing tasks w*,... and f,_i 
be the position of the traversal algorithm after completing 
these tasks (£o = to = 1). 

Given task w': 

1. Let y'niin ^ min(£,_ i, f,_i) 
and y'max ^ max(^,_ i, f,- 1 ) • 

2. Let Dq i {y'min; • • • : 7max} ■ 

{w\. 

3. Define c(;) = | 

I oo 

4. Set£,-^argmin.>^_^__^c(;). 


since Algorithm 2 may use a direct edge from ^ to 
By definition, the difference in the potential after 
processing the /th task is: 

( 3 . 7 ) = 

Let Ij be an indicator as to whether Tj ^ Dq. Al¬ 
gorithm 2 chooses an index that minimizes c, therefore 

■ Ij- Hence, for any j e 

... ,f,} we have -f •/f+i < -f • 

It-. Since Xj is a convex combination we have: 

( 3 . 8 ) ■ li+i < 

H ^ ■ 4) = 

Y, ^‘ij + ^Jmaxji • 4 ^ 

Cf. 

Case 1: < f,_i and £,■ < f,_i 


j&Do 

J ^ 7 max 
Otherwise 


3.4 Proof of Approximation 



i- 


1 tj 

- •- 


Theorem 3 . 2 . For any traversal sequence X, distance 
metric d, and sequence of tasks w*,...,w", let = 

+ d^^ be the total cost of Algorithm 2, and = 

+d^ the total cost of the traversal algorithm for that 
sequence of tasks (the processing cost plus the transition 
cost). Then < 2 • C^. 

Proof. We use a potential argument. The potential func¬ 
tion we use is O = 5^.which is the traversal distance the 
current position of Algorithm 2 and the current position of 
the traversal algorithm. We show that for every incoming 
task i. 


Cf + A^i<2-Cf, 

where Cf^ and Cf are the costs of Algorithm 2 and 
Algorithm 1 respectively when processing the /th task, 
and AO, is the change in potential after processing it. 

Let Ti = {j : f,_i < j < f,} be the set of positions 
that the traversal algorithm visits while processing the i- 
th task, and let AJ be the amount of work done in Xj in 
the traversal while processing the task. Since the traversal 
algorithm processes the whole task, L;g 7 ; ^‘j — 

Note that 

cf = +Y^j' 

jeTi 


In this case, AO = jTherefore, 

Cf 2 + AO ^ ^ w'A + AO^ = 

To T 

< C,^-f5,_,,,<2Cf. 

Case 2: 1 < f,_ 1 and f,_ 1 < f, < f,- 
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2Cj. 
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In this case, AO = —5, , , — de e. ,. Hence, 
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In this case, AO = —d, , ; + 5/ , />. Therefore, 

(3.6) (3.7) (3.8) 

cf+AO < wf+4 _,a.+AO < wf+25 ^,_,a < 2Cf. 
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If 
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Cf+AO wf+4,^,_.+AO 
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(3-7) , 

< ^Te. + ^e„ti 

(3.8) 

< 2Cf. 

Case 7; f,_i < f,_i and if < < f, 


If—1 


i-i — 


In this case, AO = “ 4-i.lf + 4.6 ^ 

-4,1,-1 +4-1.6- Therefore, 


3.5 Pricing Scheme In this section, we define a dy¬ 
namic pricing scheme that observes decisions made by 
greedy agents and sets a surcharge for the next agent to 
arrive. We want to set prices so that states chosen by 
(greedy) agents are the same as the ones chosen by Al¬ 
gorithm 2 (up to tie breaking) given the sequence of tasks 
w. 

The problem is that dynamic pricing schemes, as 
we’ve defined them, can only observe the greedy be¬ 
havior of agents, the state chosen by the agents, and 
the amount of work done in that state while process¬ 
ing the task; Namely, after agent i has selfishly used 
the system, the mechanism observes i,, the state cho¬ 
sen by agent i, and w^., the amount of work done in s, 
while processing the task, but the other coordinates of w‘ 
(w'j,... ..., wj„) are not known to it. 








We resolve this issue by giving the pricing scheme 
a different (imaginary) sequence of tasks w with the 
following consistency properties: 

1. For every i, w'^. is equal to vv^.. 

2. For every i and for every j G S\ {i,}, w‘j > w‘j. 

3. For every i, given the pricing function f) over the 
state of the system after observing events 1 ,..., i — 1 , 
the computed vector vi>' is consistent with the agent 
selfishly choosing to process her task in state s,-; I.e., 
for all j 

(3.9) w‘j + dsi_iJ + PiU) > + dsj_i,Sj + Pi{si) ■ 


Pricing Scheme 1 A dynamic pricing scheme that makes 
incoming agents “follow” Algorithm 2. 

Input: A traversal sequence T, a set of states S, a 
distance metric d and an observed sequence of deci¬ 
sions (i i, w],), (^ 2 , j), made by agents 

1 , 2 ,..., 1 - 1 . 

The variables and f,_i represent two indices in the 
traversal sequence, set after the i— 1th agent’s decision. 
For 1—1 =0 take fo ^ 1 and Iq ^ 1. 

Setting prices for agent 1: 

1. Let jmm ^ min (fi, i) 
and jmax ^ max(£,_ i, i). 

2. Let Dq i {ymin; ■ ■ ■ 5 Jmax } ■ 

3. For a state s let m{s) ^ min{y > jmmK; = ■* *}• 


Note that running Algorithm 2 on the sequence of 
tasks w yields a 2-approximation to the cost of the Frac¬ 
tional Traversal Algorithm on the sequence of tasks w: by 
Lemma 3.1, the cost of the Fractional Traversal Algorithm 
on w is no larger than its cost on w, and by Theorem 3.2, 
Algorithm 2 gives a 2-approximation to the cost of the 
Fractional Traversal Algorithm (on any sequence, in par¬ 
ticular w). Therefore, it suffices to find a pricing scheme 
that makes the agents follow Algorithm 2 on sequence w. 

We show how to find w that satisfies the consistency 
properties in Section 3.6. From now on, we assume such 
can be computed after observing agent I’s greedy 
behavior. We show how to devise a pricing scheme that 
given w, makes incoming agents act as though they are 
being guided by Algorithm 2 given input w. We describe 
how to do this in Pricing Scheme 1 . We note that although 
Pricing Scheme 1 may price states using negative number, 
we can increase all prices by a large enough constant 
without affecting the decisions of the agents. 

Theorem 3.3. Pricing Scheme 1 makes greedy agents 
simulate Algorithm 2 with task sequence w given as input. 


4. DefineflWs 

{ -4-1,^ + Otherwise. 

Determining and f, after observing (i, ,Wj): 

1. Set ^ m(s,). 

2. Compute vi>' using the set of equations (3.10) as 
described in Section 3.6. 

3. Update f, as done by the Fractional Traversal Algo¬ 
rithm (Algorithm 1) using w' as input. 


if mis) G Dq, then 


c{m{s)) = Wg + ds._i^s-ds._,^s 

= c(m(s)). 


where the third inequality is derived from (3.9), and 


Proof. For every state s' G S, let m{s') be as defined in 
Pricing Scheme 1 and let c be as defined in Algorithm 
2. Since m{s') G argmin^.^.^,.; ( 7 ( 7 ), we may assume that 
Algorithm 2 always prefers m{s') amongst the indices for 
which Tj = s'. 

Let s be the state chosen by the agent. If m{s) G Dq, 
then the disutility of the agent is -f j ^-f P, (i) = 
+dsi_^^s — ds._^^s = c{m{s)). If m{s) f. Dq, then now 
the disutility is 

By the way w‘ is computed, we get that if the algo¬ 
rithm would have received w‘ as input, then given s G S 
that was not chosen by agent /, we have that. 


• if m{s) f. Dq, then 

c(m(s)) = w'g + ds,_i,s- ds,_i,s + 

= -f dsi_ ( +Pi{s) 

= c{m{s)), 

where the third inequality is derived from (3.9). 

Therefore, the agent picks a state s such that m{s) 
minimizes c given task w'. Algorithm 2 also chooses 
an index that minimizes c. Since the tie breaking of 
Algorithm 2 can be done arbitrarily, we can assume that 





the algorithm chooses index m{s) as well when given task 

w‘. □ 

Corollary 3.1. Pricing Scheme 1 yields a price of 
anarchy ofl6{m— 1). 

3.6 Computing “Imaginary” Tasks As mentioned in 
Section 3.5, Pricing Scheme 1 cannot observe the real 
sequence of task vectors w. This is problematic, since 
the pricing scheme runs a simulation of the fractional 
traversal algorithm, which takes the task vectors as input. 

To resolve this issue, we need to compute imaginary 
task vectors with the three consistency properties, as 
described in Section 3.5. We are given a pricing function 
Pj : S i—i' K+ that is set on states of the task system after 
the arrival of the i— 1-th agent, a state s, chosen by the i- 
th agent, and the amount of work done in that state while 
processing the tasks (which is exactly ). Let w‘ = 
(w j, W 2 ,..., wj,,) be the current incoming task (which we 
cannot observe). We claim that the vector vi>' defined 
below is coordinate-wise dominated by w‘ and consistent 
with the agent greedily choosing to process the task in i, 
while the states are priced using Pf. 

(3.10) M>y: = max{0, 

Wj. + ,s,- + Pi (si) — j — Pi (j)} 

for all 7 G S. Note that w\ = w\. 

By definition, w‘j > 0, and since the agent chooses 
Si, we have that w‘j + j- -f Pfj) > w‘^. + + 

Pi (s,) , i.e., w) > + Pi (si) - d,._^J - Pf]). 

Therefore, it holds that vvl > w'j. In addition, since 
w) > w',.+ds^_^,si PPi{si)-ds._^J-Pi{i), we have that 
w)Pdsi_ij+Pi{i) > wj,. -f -f as required in 

(3.9). We conclude that we are able to compute a task 
vector which satisfies the desired consistency properties, 
thus enabling us to devise pricing over states that insures 
a nearly optimal price of anarchy. 

4 Pricing Servers on Trees 

The k-server problem, introduced by Manasse et al. [19], 
is defined as follows. Let {V,d) be a fixed metric space. 
There are k servers, initially located in specified points 
5 ’^ = {■s? A 2 >' ■ ■ >■* 4 } ^ ^ j ^ k, s^ GV, that can 

be moved by the algorithm. Moving a server from x to y 
costs d{x,y). The goal is to serve a sequence of arriving 
requests with minimum possible cost. Each request r; is a 
point in V, and to serve it, the algorithm has to move (at 
least) one server to that point. 

In our setting, we assume that the requests are made 
by successive agents. Moreover, we assume that each 
agent is greedy, and will therefore ask that their request 
is served with the server that minimizes the sum of its 


distance to the request and the price we set for the server. 
Therefore, before the arrival of the /th agent, the algorithm 
sets a payment function for the servers: P,: 5'^* 1 —7 M, and 
agent i would choose to move server: 

£i G argmin{d(ri,sy^)+Pi(sy^)}, 

j 

and the next state would be 

5'=5'-'\{4'}uM. 

The social cost is defined as; 

|a| 

!=1 

On a high level, we want to find out if any algorithm 
can be simulated by appropriate pricing, i.e., whether 
given an algorithm A and its current state (locations of 
servers), we can give a pricing of servers that will make 
an arriving greedy agent with a request r, serve r in the 
same way that A would have, wherever the request occurs. 

For some algorithms, designing a payment scheme 
is straightforward. For instance, Irani and Rubinfeld 
proved that the following 2-server algorithm Balance2 is 
10-competitive on any metric space [10]. The algorithm 
keeps track of the total distance that each of the two 
servers has traveled, and serves each request with the 
server that minimizes the sum of its distance to the request 
and half the total distance it has traveled before. Clearly, 
setting the price of each server to the second summand, 
i.e., half the total distance the server has moved already, 
will make the agents follow the algorithm. The only catch 
is that we have no control over tie-breaking since it is done 
by the agents. However, this can only affect the total cost 
by an additive constant. 

In general, however, simulating any algorithm A is 
not as easy. For instance, A could move many servers (in 
a non-lazy fashion) upon a request. This is the case with 
some known algorithms, for example with the Double 
Coverage (DC) algorithm [4] that is k-competitive for the 
line metric. This algorithm has been generalized to all 
tree metrics, retaining the optimal ratio [5]. As an agent 
is only allowed to pick a server that will serve the request, 
such complex algorithms cannot be simulated directly. 

In this work, we restrict our attention to k-server in 
tree metrics, and the optimal algorithm for this setting [5] 
in particular. We observe that the standard transformation 
to make the algorithm lazy, if done carefully, preserves its 
other properties, which we call locality and monotonicity 
(all defined later), and that combined these enable sim¬ 
ulation by a pricing scheme. We remark that, in general, 
such transformation employs keeping track of “virtual po¬ 
sitions” of servers and basing decisions on those. This 
makes designing a pricing scheme to simulate the algo¬ 
rithm challenging, since an agent’s decision is based on 
the “real positions” of the servers (and their prices). 


4.1 Notation Given an algorithm A, let Sa — 

denote its state, i.e., the set of points 
where A’s servers are currently located. We will associate 
the servers with the points they are in. Similarly, we will 
associate a request r with the point in which it occurs. 
Given two points v and w in a tree, we let ^{v,w) denote 
the unique path between v and w. We say that a server 
s S 5/1 is adjacent for a point v if 3^{y, s) H 5/i = {s}, i.e., 
there is no other server from Sa on the path 3^{y,s). 

A deterministic online algorithm is lazy if the only 
moves it makes consist in moving a single server to the 
current request. An algorithm is local if it always serves 
a request with a server that is adjacent to it. Furthermore, 
an algorithm A is monotone if it satisfies the following 
property: given Sa and a request at r, if the algorithm 
would serve a request at r with s G Sa, then it would also 
serve a request at any vertex on the path £P(r,s) with s. 

4.2 Matchings Later, we give a transformation that 
takes an algorithm A and returns an “equivalent” algo¬ 
rithm A' that is lazy. This transformation relies on mini¬ 
mum cost (perfect, bipartite) matching of Sa to Sa', where 
the costs are specified by the metric distance d. Given two 
sets of points X and Y of equal cardinality, we denote any 
such matching by M{X,Y) and its cost hy d{X,Y). 

When the metric space is a line, we associate points 
with real numbers, use inequalities to specify the order 
of points, and note that d{x,y) = |v —y|. Moreover, we 
remark that in such case there exists a perfect matching 
of particular structure. Namely, given X = {xi ,X 2 , - ■■ ,Xk] 
and Y = {yi ,y2) ■ ■ ■ A<r} that are sorted from left to right, 
i.e., xi < X 2 < ■ ■ ■ < Xk and yi < y 2 < • ■ ■ < J/i, we call a 
matching in which Xi is matched to y, for all i a canonical 
matching and denote it ^{X,Y). 

Lemma 4.1. The canonical matching on a line is a min¬ 
imum cost perfect matching. 

Proof. Let M = M{X,Y) be any minimum cost perfect 
matching of X to T. We show how M can be transformed 
to the canonical matching without increasing its cost. As 
long as M is not canonical, hnd the minimum i such that 
Xi is not matched to y,. Then x; is matched to some 
yj and y; is matched to some x/,. Moreover, j > i and 
h > i since i is minimal. We assume wlog that Jc/ < yi 
(if not, swap the sets). This implies that yj > y,- > x,. 
Modify M as follows: match x, with y, and x/, with yj. 
Matching x, to y; decreases the total distance by yj — y,. 
By the triangle inequality, |x/, - y;j < |x/, - y, | -f |y/ - yj | = 
|x/i — y,j -\-yj — yi, i.e., matching x/, to yj increases the 
total distance by at most yj — yi. Hence, the cost does 
not increase throughout the transformation, and is thus 
minimized by the canonical matching .y^{X,Y) we obtain 
at the end. □ 

Given X and Y and a point r GX,we call a matching 


Algorithm 3 The algorithm transformation. 

Input: An algorithm A and an online sequence of requests 
ri,...,r,-. 

Output: A lazy algorithm A'. 

1. Let Sa be the state of A after serving r, and Sa' the 
state of A' right before serving r,; note that r, G Sa. 

2. Find M — M(Sa,Sa') and serve r,- with the server 
matched to it in M. 


of X to T r-local if r is matched to a point in Y that is 
adjacent to it. 

Lemma 4.2. For any two sets of points X and Y suchthat 
IXj = |y| in a tree and a given point r GX, there exists a 
minimum cost perfect matching ofX to Y that is r-local. 

Proof. Let M — M{X ,Y) ht any minimum cost perfect 
matching of to T. If M is not r-local, do the following. 
Let y be the point in Y that r is matched to. Then there 
are other points from Y on the path (y, r); let y' be the 
one that is closest to r (the last one on the path to it). It 
follows that y' is local for r. Let x GX hs the point that 
y' is matched to. Turn M into another perfect matching 
M' by matching r to y' and x to y. Since y' G 3^(y, r), we 
have d{y' ,r) =d (y, r)—d{y',y). Moreover, by the triangle 
inequality d{x,y) < d{x,y') -\-d{y',y). Thus, the cost of 
M' is no larger than that of M, and hence M' is an r-local 
minimum cost perfect matching of X toY. □ 

We denote a (fixed) r-local minimum cost matching 
of X to Y hy Mr{X,Y). In particular, for the line metric, 
we call the r-local matching that is the result of the 
procedure described in the proof of Lemma 4.2 applied 
to .y^{X,Y) an r-canonical matching, and denote it by 
.JfriX,Y). 

4.3 Algorithm transformation Algorithm 3 describes 
how to transform any algorithm A into an equivalent one 
that is lazy and never incurs larger cost than A. Later, 
we observe that (wlog) the output algorithm is also local. 
Finally, we note that for DC the transformation also 
preserves monotonicity. 

In general, the transformation only guarantees that 
its output is a lazy algorithm of no larger cost than A. 
The additional properties rely on particular structure of 
the input algorithm and the matching that is used upon 
requests. 

Lemma 4.3. Given an algorithm A, the transformation 
produces a lazy algorithm A' such that for every sequence 
of requests R — r\,... ,ri, it holds that C {A' ,R) < C{A,R). 

Proof. It follows directly from the transformation that A' 
is lazy. In order to bound its cost, denote the costs of 






Algorithm 4 Double Coverage on a line. 

Input: A request r on the line. 

1. Let sl and SRhe the servers of DC immediately to the 
left of r and right of r respectively. (Note that one of 
them may not exist; in such case ignore it.) 

2. Move both sr and sr towards r by 
mm{d{sL,r),d{sR,r)}. 

3. Serve r with the server that reached it. (Note that 
both Sr and sr could have.) 


A and A' upon serving request r, by C,(A) and C,(A') 
respectively. We will prove that 

(4.11) C,(A')+A<I><Q(A) , 

where <I> = d{SA,SAi) is a non-negative potential function. 

To prove (4. 1 1), it suffices to consider the moves of A 
and A' independently, in this order. Fix M — M{Sa,Sa/)- 

We keep M hxed as A moves its servers. Clearly, 
when A moves a server s by distance d, the cost of M does 
not increase by more than d. Hence, the same holds for 
the perfect matching of minimum cost. Thus <I> increases 
by at most d, and (4. 1 1) holds. 

Once A is done with its moves, we analyze the move 
of A'. Note that at this point r, G Sa, i.e., A has one of 
its servers at r,-. Let M = M{Sa,Sa') be a minimum cost 
matching of the algorithms’ servers and s G Sa' the server 
of A' that is matched to r, . Upon the move of s to r,, O 
decreases by at least d{s,ri) since this is the case if we 
keep M hxed. (It is actually easy to see that there is no 
cheaper matching). Thus, C, (A') + A4> < 0 holds. □ 

Remark 4.1. The transformation produces a lazy local 
algorithm if, upon each request ri, an rt-local matching is 
used. 

Note that we have proved that the algorithm that 
our transformation produces is local but not that it is 
monotone. Nevertheless, for the case of line and the 
Double Coverage (DC) algorithm [4, 5], we can prove 
monotonicity. 

4.4 Transforming DC on a line The Double Coverage 
(DC) algorithm has been originally designed for the line 
metric [4], and later generalized to all tree metrics [5]. For 
these metrics, it attains the optimal competitive ratio of k. 
For completeness, we describe DC for the case of the line 
metric in Algorithm 4. 

To prove that DC on a line can be transformed to 
a lazy local monotone algorithm, we will rely on the 
fact that DC itself is local and monotone, as well as on 
properties of canonical matchings on a line. 


Lemma 4.4. Applied to DC on a line, the transformation 
produces a lazy local monotone algorithm if, upon each 
request r,-, an rt-canonical matching is used. 

Proof. We focus on monotonicity since the other prop¬ 
erties follows from Lemma 4.3 and Remark 4.1. To 
this end, let DC' be the output of the transformation and 
^DC = {si,.. ■ ,Si:}, where s\ < S2 < ... < sr, its current 
state. Since for each request r, we use the r-local canon¬ 
ical matching in the transformation, r will be served by a 
server of DC' that is local for it. In particular, DC' will 
serve a request r <si with and a request r>SR with sr. 
Similarly, DC' will serve a request at position of one its 
servers s with s. 

Now consider Si < r < ij+i, i.e., a request between 
two adjacent servers of DC'. Suppose that DC serves 
r with its y'-th leftmost server. Recall that an r-local 
canonical matching between the state of DC after serving 
r to Sr)c’ would be used in the transformation for a request 
(at) r. In the canonical matching r would be matched to 
Sj but as this matching is made local, r is matched to the 
last server of DC' on the path from Sj to r. In other words, 
r is matched to i, if j < i and to otherwise, and it 
is served by the server it is matched to. Since DC itself is 
local monotone, j is a non-decreasing function of r, which 
implies that DC' is monotone. □ 

4.5 Payment scheme We dehne a payment scheme as 
P: 5a I—K. The disutility of an agent while serving a 
request r using server s is dehned as d{r,s) +P{s). We 
assume that at each iteration the request r is served using 
the server s that minimizes d{r,s) +P{s), where r is the 
request the incoming agent wishes to serve. 

Theorem 4.1. For every lazy local monotone determin¬ 
istic k-server algorithm A on a tree, there exists a payment 
scheme P that makes incoming agents weakly follow A 
and incur movement cost no larger than that of A. 

See the remarks after the proof for explanation what 
“weakly” means. 

Proof. We obtain the desired pricing P by hxing the price 
of an arbitrary server in Sa and setting the remaining 
prices so that they satisfy a system of linear equations. 
The equations follow from the following consideration. 

Let Sa be the state of the A. Since A is lazy local 
monotone, for every server s G Sa, there is subtree con¬ 
taining s consisting of all the points that would be served 
by s if a request were made there. Clearly, these subtrees 
form a partition of the underlying tree. From now on we 
refer to these subtrees as regions that particular servers 
are responsible for to avoid confusion. Since the underly¬ 
ing space is a tree, the adjacency relation on the regions 
dehnes a subtree spanning the regions. 






Let s and s' be a pair of servers whose regions 
are adjacent. Then, there is a threshold point 
(which may be one of s, s') on the path ^^{s,s') such 
that, ignoring v{s,s') itself, the algorithm would serve 
all requests on £P{s,v{s,s')) with s and all requests on 
t^(s',v(s,s')) with s'; a request at v(s,s') would be served 
with either of s, s'. Having identified the threshold vertex 
v(s,s'), we introduce the following equation: 

(4.12) P{s)+d{s,v{s,s')) = P{s')+d{s',v{s,s')) . 

We claim that the system of equation(s) (4.12) taken 
for all pairs of adjacent servers has a unique solution once 
the price for an arbitrary server is fixed. This follows, 
since we introduced one equation per edge of the tree 
of regions; in particular, acyclicity guarantees that the 
equations are independent. In fact, the pricing that is 
the unique solution to our system can be found by a BPS 
traversal of G, starting at the vertex with fixed price, and 
using (4.12) at each newly visited node to determine its 
price. 

To see that the pricing P makes the agents follow the 
algorithm, consider a request at point v that A would serve 
with s. Then for any s' GSa, consider the path from s' to v, 
and let s' = so,si,... ,Si = S be the sequence of servers re¬ 
sponsible for successive regions on it. By (4.12), for each 
j < i, we have P{sj) = P{sj+i) ^ d{s j+i,v{sj,sj+i)) - 
d{sj,v[spSj+\)) > P{sj+\) — d[sjj^\,Sj). Summing the 
inequalities over a prefix of the path, we get that for all 

j<i 

(4.13) P{s')=Piso)>P{sj)-d{so,Sj) . 

Therefore, if the server s = Sj itself lies on the path, 
(4.13) yields that P{s') = P{s') +d{s',s) + 

d{s,v) > P{s) + d{s,v), i.e., that s minimizes the disu¬ 
tility among servers responsible for the regions along 
the path. If s does not lie on this path, i.e., if v 
lies between s' and s, we note that by (4.13) for j = 
i- 1, P(s') = P{s') +d{s',Si^i) +d{si_i,v) > 

-f v). But as the region is responsi¬ 

ble for is adjacent to the one s is responsible for, (4.12) 
guarantees that P{s) + d{s,v) < P{s) + d{s,v{s,si^i)) = 
i)+d{si-i,v{s,si^i)) <P{si-i)+d 1 ,v). Thus s 
minimizes the disutility also in this case, and thus overall. 
□ 

As noted before, we stress that tie-breaking is up to 
the agents, and therefore we cannot guarantee that the 
agents will follow an algorithm exactly. This is what we 
mean by saying in Theorem 4.1 that the agents “weakly 
follow” such algorithm. However, tie-breaking affects 
neither Lemma 4.3 nor Remark 4.1, i.e., even if agents 
break ties differently than the algorithm, in particular, 
even if they move non-local servers, we can still price the 
servers in such a way that the cost of moving servers by 


the agents is no larger than the cost of moving them by the 
algorithm. 

We sketch why this is the case. If for a given request 
r, more than one adjacent server minimizes the disutility 
of serving r, then r is exactly at the threshold vertex of the 
pricing scheme’s design, i.e., the boundary of at least two 
regions. Then it is easy to see that either server can be 
matched to r in a minimum cost matching, i.e., a move 
of either server could result from the transformation, 
depending on the matching that is chosen. To deal with 
non-adjacent servers, it is easiest to assume that for any 
pair of adjacent servers i; and i,+i, the threshold vertex 
is always strictly between them. This can be 
done by moving the vertex by arbitrarily small £ > 0 away 
from the server s it coincides with. Clearly, this may 
result in violating (4.11) only by 2e, and we can make 
£ decrease exponentially for successive requests, so that 
the total additional additive cost is at most c ■ £, where c 
can also be made arbitrarily small. 

Corollary 4.1. There exists a payment scheme for the 
k-server problem on a line which yields a price of anarchy 
ofk. 

5 Pricing Parking Slots on a City Block 
5.1 Problem Definition We consider the following 
strategic parking problem; the underlying problem is 
known as the (online) metric matching but we use the 
parking terminology. 

The “town” is modeled as an undirected graph G = 
iy,E), |y| = m. Vertices of G represent empty parking 
slots, which are the possible destinations of drivers arriv¬ 
ing in town. Every edge e = {u,v) G E has an associated 
weight We, corresponding to the distance between vertices 
V and u. 

As for the cars: 

• A total of n cars are going to park over time (n < m). 
Cars do not depart until every car has parked. 

• Cars arrive online, one at a time, in order of their 
index, 1,2,... ,n. 

• Cars can only park at unoccupied parking spots. 

• Cars behave selfishly and seek to minimize their 
disutility (defined below). 

We denote the parking spot taken by car i by x{i), and 
call the function x:[n\^V that maps cars to parking slots 
an assignment. 

A dynamic pricing scheme P = {Pi,P 2 ,...,Pn), 
where 


: {x(l),x(2),... ,y(! — 1)} X y I—)■ M , 


sets a price for an unoccupied parking space v G V 
given that there are i — 1 occupied parking spaces: 
x{l),...,x{i— 1). The functions P, may use random bits. 
Set Po(v') = 0 for all v G V (or any other constant). When 
we refer to P,(v), we assume that i— 1 cars have already 
arrived and are parked at v(l), ... ,x{i— 1). Thus, 

• Given G and i G and given that cars 

j G 1,... ,i — 1 are already parked at locations 
v(l),.. . ,x{i — 1), Pi { v ) G K gives a price to park at 
V G y \ {.r(l),. .. ,x{i — 1)}. This is a one-time park¬ 
ing fee, not (say) charged by the hour. 

• The current parking fees, P;(v), v G V \ 
{.r(l),... ,.r(/ — 1)} are known to car i before 
parking. 

Incentives for cars are determined as follows: 

• Let g{i), I < i < n the be destination of car i, g is 
called the goal function. 

• Given the graph G, goal function g, and assign¬ 
ment function x, the non-monetary cost to agent 
i G [n] with destination g{i) and an assigned parking 
slot x{i) is d (g(i) ,x(i)), the weight of a minimum 
weight path from x(i) to g(i) in G. 


5.2 Dynamic Harmonic Pricing for the Weighted 
Line Graph In this section we describe the harmonic 
pricing scheme that guaranties an O (min {logP,n^}) ap¬ 
proximation to the minimal cost matching in expectation, 
where R is the aspect ratio of the underlying metric. The 
expectation is over the random bits of the mechanism. 
The coordination mechanisms sets fees to the vacant park¬ 
ing slots. The payments are devised without knowing the 
goals of the drivers. Once set, a car parks so as to mini¬ 
mize its disutility. 

The improved approximation guaranties of this 
mechanism should be compared with the £2(2”) approx¬ 
imation lower bound for the greedy matching on the 
weighted line graph with free parking [11, 14]. On the 
same input, the harmonic pricing scheme incurs a cost of 
n in expectation. 

The pricing is designed to emulate the Harmonic 
Algorithm of Gupta and Lewi [8] for a line metric. This 
simple algorithm works as follows: let di be the distance 
of the arriving car from the nearest free parking slot to 
the left of it, and dr be its distance from the nearest free 
parking slot to the right. The algorithm assigns the car to 
the slot on the left with probability and to the slot 

on the right with the remaining probability. Gupta and 
Lewi proved the following. 


• The disutility of agent i (who chooses to park at 
unoccupied slot x{i)) is d {g{i) ,x{i)) +Pi{x{i)). I.e., 
the disutility is the sum of the non-monetary cost to 
agent i plus the parking fee to park at x{i). 

Therefore, the agent will choose to park at: 

Si = argmin{^f (g (i) ,s) +T; (s)}, 

seSi 

where 5, is the set of unoccupied vertices of G at the time 
agent /’s arrival. 

As the goal is cost minimization, the target function 
/ is the social cost, 

|(t| 

fi'^,s) = Y,d{gii),s), 

!=1 


i.e., the sum of all the agents’ distances from their goals. 

Since the pricing schemes presented in this section 
are randomized, the price of anarchy, dehned in Equation 
(1.2) should be taken in expectation, and is dehned as 


(5.14) 


E(/(aA)) 

max ■" _ 

a.seeq(P.a) /(a,Opt((7)) 


where eq(P,(7) is the set of equilibria assignments of P, 
which depends on the random bits used in P. 

We refer to the pricing scheme P that satishes P, (v) = 
0 for all / G all v( 1),..., v(i — 1) and all v G 

V \ {x(l),... ,x(i — 1)} as the/ree parking, and its price 
of anarchy as the cost of free parking. 


Theorem 5.1. ([8]) Let t/max be the maximal distance 
between two vertices, and let t/niin be the minimal one. 
The competitive ratio of the harmonic algorithm on a line 
metric is O ^log . 

Theorem 5.1 yields an G(logn) approximation algo¬ 
rithm to the min cost matching in the case where quantity 
dmax/dmm. Called the aspect ratio of the metric, can be 
bounded by a polynomial of n. In fact, Gupta and Lewi [8] 
also show how this algorithm can be combined with the 
doubling technique so that effectively the aspect ratio is 
always bounded by a polynomial of n. This relies on con¬ 
tinuous estimation of the optimum cost and modifying the 
metric distances based on that. 

We provide two pricing schemes. The price of anar¬ 
chy of the simpler one matches the competitive ratio guar¬ 
antee of Harmonic as stated in Theorem 5.1. The more so¬ 
phisticated pricing scheme matches the rehned bound of 
G(logn) given an estimate of the (hnal) optimum social 
cost a priori. 

5.2.1 Computing the posted parking fees We now 

show how to set prices to the vacant parking slots that ef¬ 
fectively makes incoming [selhsh] agents park as though 
they were obeying orders given by the Harmonic Algo¬ 
rithm. As the algorithm is randomized, so will be the 
pricing. 

We say that a pricing scheme ensures harmonic be¬ 
havior if, when an agent arrives: 





• She chooses to park at her goal if vacant. 

• Otherwise, she chooses between the two closest 
vacant slots to the left and right of her goal with 
harmonic probabilities. 

A maximal contiguous run of occupied parking 
spaces is called a block. 

Just before car i is set to park, let V' = {vi,..., v^} 
the set of vacant parking spots and let 81 , 82 ,... , 8 ( C V 
the current set of “blocks”. We use L{ 8 j) to denote the 
hrst available vertex to the left of block 8 j, for some 
j G and let R{ 8 j) denote the hrst available 

vertex to the right of 8 j. Let dj = d{L{ 8 j) ,R{ 8 j)), 
we dehne a set of distributions Di,D 2 ,... ,Di, where for 
every j G {1,...,^}, Dj is uniformly distributed on the 
real interval [—dj,dj]. 

Let P; y' i—7> IR+ be a random function that sets prices 
to vacant slots. Previously, this function was denoted Pi, 
but we omit the index i for the remainder of this section. 

We say that P satishes the Hannonic Payment Condi¬ 
tions if: 

1. For any J e {1,..., £}, P {L { 8 j)) -P{R { 8 j)) ~ Dj. 

2. For any u,v G V' s.t. there are no unoccupied 
parking slots between them: —d («,v) <P(u)- 
P{v) < d {u,v). 

Lemma 5.1. Any pricing function P that satisfies the 
Harmonic Payment Conditions, ensures hannonic be¬ 
haviour on the part of arriving cars, regardless of their 
goals. 

Proof. First, notice that the second condition of the har¬ 
monic payment conditions implies that for any u,v G V', 
-d{u,v) < P{u)-P{v) < d {u,v), even if there are va¬ 
cant slots between them. This implies that if an agent’s 
goal, g, is vacant, then for all v G V' \ {g}, we have that 
P{g) < P{v) -\-d (g,v). Ergo, if the agent’s goal is vacant, 
she parks there. 

Now consider an agent whose goal v is blocked, i.e., 
V G 8 j for some j G Let vl denote L { 8 j) and 

vr denote R { 8 j). For every vacant slot v' G V' left of vr 
wehavec/(v,vz.)-fP(vz,) <d{v,vi)^d{vu^)^P{f) = 
d{v,v') -GP{v'). Symmetrically, for every vacant slot v' 
right of vr, we have that d{v,VR) -GP{vr) < d{v,v') -f 
P(v'). Therefore, the agent has only to choose between 
vl and vr. 

Let Fj be the density function of distribution Dj. 


Notice that Fj (x) = . We have that: 

Pr[The agent takes v/,] 

= Pr [d (v, Vl) G- P {vl) < d {v,vr) + P (v^ )] 

= Pr [P (vl) - P (vr) < d {v, Vr) - d (v, vl)] 

= Pt[P{vl)-P{vr) <dj-2d (v, vl)] 

= Fj{dj-2d{v,VL)) 

_ 2dj - 2d {v, Vl) 

~ 

_ d{v,VR) 

dj ’ 

where the forth equality is due to the first harmonic 
payment condition. Therefore, we get that the agent 
follows the harmonic behavior. □ 

Next, we show a specific pricing scheme that satisfies 
the harmonic payment conditions. 

Observation 5.1. Let R{v) be the set {j G : 

V is left of8j} and let q\ ~ L>i ,g2 ~ L) 2 ,... ,qj'^ Dj. For 
any constant c, setting P(v) = c-f- Y,jeR{v) Qj satisfies the 
Harmonic Payment Conditions. 

We can use the constant c in order to obtain a pricing 
scheme that satishes the harmonic payment conditions in 
which there are no negative transfers. One can ask for 
a pricing scheme that optimizes an additional objective: 
minimizes the sum of prices of all vacant parking slots, 
the maximal price of a slot, the difference in pricing of 
the slots over time, etc. We can use an LP to satisfy such 
objectives. For example. Linear Program 1 minimizes 
the sum of prices of slots given qj ~ Dj as prescribed in 
Observation 5.1. Note that Linear Program 1 is feasible 
since the payment scheme of Observation 5.1 satishes all 
its constraints. 

Since we have shown there exists a pricing scheme 
which makes incoming agents simulate the behavior of 
the harmonic algorithm, we can use this pricing scheme 
to coordinate incoming agents, and improve the PoA 
signihcantly. 

In graphs where the aspect ratio is bounded by some 
polynomial in n, we get: 

Corollary 5.1. Any pricing scheme which satisfies the 
harmonic payment conditions gives an O (logn) approxi¬ 
mation to the optimal assignment when the aspect ratio is 
bounded by poly(n). 

For general weighted line graphs, without further assump¬ 
tions, we get the following: 

Theorem 5.2. Any pricing scheme which satis¬ 
fies the harmonic payment conditions gives an 
0 ^minjlog approximation to the optimal 

assignment in weighted line graphs in expectation. 




Linear Program 1 A linear program for pricing that satis¬ 
fies the harmonic payment conditions and also minimizes 
the sum of parking fees over all parked cars. Note that this 
program actually minimizes the sum of the posted prices 
of vacant parking spaces, but — any feasible solution to 
this LP results in the same harmonic behavior so, by re¬ 
peatedly computing such pricing, we minimize the sum of 
payments of all parked cars. 
minLvGV'Pv s.t. 

• Vy S { 1,..., PL(Bj) = PR{Bj) + 9j- 

• Vu,v G V' s.t. there is no vacant slot between them, 
—d (m, v) < pu— pv < d (m, v). 

• Vv G y', Pv > 0. 


Proof. The proof that the harmonic algorithm is 
O (log J competitive is given in [8]. For proving that 

the competitive ratio is also bounded by O^rP'), we use the 
same notation as in [8]^. 

Replacing the final bound on 'E[5,nax\8{\ in the proof 
of Lemma 4 [8] by the following one gives us the desired 
result. 

Lemma 5.2. E[5ma.r|5i] <rP5\. 

Proof. First note that 5,„a.x is the distance between two 
servers on the line, therefore there are at most rP possible 
values for this distance. Let J be the set of possible 
distances. We get that 

^7c[Sma.x\Sl] = ^kPr[5max = ^|5l] < ^ kPr[5ma;i-> k|5i] 

keJ keJ 

= E^e5,.<E4 = E5i = |y|5i 

keJ keJ ^ keJ 

<rP5\ , 

where 5; x Lemma 4(ii) of [8]. □ □ 

5.2.2 Pricing with a Prior In this section we show 
how to improve the expected PoA via a dynamic pricing 
scheme when the city planner has an estimate of the cost 
of the optimal matching. This seems to be a reasonable 
assumption in many scenarios, since the city planner only 
needs to approximate the optimal matching to within 
a poly(n) factor in order to devise a pricing scheme 
that achieves an (9(logn) approximation. This is an 
improvement over the harmonic pricing scheme whenever 
the aspect ratio is superpolynomial in n. 

full version is available at http://theory.stanford.edu/ 
-klewi/papers/metric-matching-full.pdf but its analysis of 
Harmonic is slightly different. 


Gupta and Lewi show that given Z G [OPT, c ■ OPT] 
where c > 1, one can get an O {logcn) approximation 
using the following metric change; 

• Disconnect every edge with weight > Z. 

• Change every edge of weight smaller than Z/ (2c • n^) 
to be of weight exactly Z/ilc -n^). 

Let d' denote the new metric after the transformation. 
First, note that the aspect ratio in d' is 2c • since the 
maximal distance is at most Zn, while the minimal one 
is at least Zjiflc-rP'). Another key observation is that 
since the optimal matching doesn’t use any disconnected 
edges and the weight of each edge increased by at most 
Z/(2c-n^), the cost of every matched server-request in 
the optimal matching increased by at most Z/(2c-n). 
Therefore, if OPT denotes both the optimal matching in 
{y,d) and its cost in {V,d), then the cost of OPT in {V,d') 
is at most OPT + Zl2c < L50PT. As the weights in d 
are no larger than in d', running the harmonic algorithm 
in iy,d') yields an 0{logcn) approximation to OPT in 
iV,d). 

Thus, given Z, an estimate of the cost of the optimal 
matching, a dynamic pricing scheme needs to simulate 
the change in the metric. Specifically, the pricing scheme 
needs to: 

1. Make sure agents don’t “jump over” edges of weight 
bigger than Z. 

2. Make incoming agents simulate the behavior of the 
harmonic algorithm according to the transformed 
metric by acting selfishly. 

We say that an online matching algorithm is mono¬ 
tone if the following conditions hold: 

1. If an incoming request is at a position of an un¬ 
matched server, the server will be matched to that 
request. 

2. Let Bj be some block of matched servers, let = 
L{Bj) and vr = R{Bj). Consider any two points 
a,b G Bj and a request to be made at either of them. 
Then 

d {a,VL) < d {b^VL) 

Pr [request at a would be matched to vl\ 

> Pr [request at b would be matched to vp . 

It is easy to see that running the harmonic algorithm on 
both the original and the transformed metrics yields a 
monotone algorithm. The following theorem shows that 
every monotone algorithm can be priced: 






Theorem 5.3. Every monotone online algorithm for the 
min cost matching problem on a line can be transformed 
into a dynamic pricing scheme which makes incoming 
agents simulate the algorithm. 


Proof Let A be a monotone algorithm for the online min 
cost matching problem. Let 81 , 82 , ■■■ ,Be C V be the 
blocks that represent the current state of the graph, and 
let y' = y \ Uj=i Bj the set of vacant parking slots. Let 
vl = L{8j) and vr = R {8j). Let : 8j (0,1] be a 
probability function that maps every vertex in the block 
into the probability of the vertex being matched by vr in 
A. Let y,-= C 8 j the set of vertices for 

which p^^ (^;) > Pl > ■ ■ ■> pi ■ We define a 
distribution Dj which determines the difference between 
P{vr) and P{vl). For every i G {l,...,r}, let A, = 

Dj is defined via the following cumulative density 
function Fj : 

f 0 A < -dj 

F,(A)=i l-pi(v') A,>A>A,_i 
[1 A > dj, 

where Aq = —dj and A,+i = dj. From A’s monotonicity, it 
is clear that —dj < Ai < A 2 <...< At < dj, and therefore 
Fj is a valid density function. 

We claim that a dynamic pricing scheme P : V' i-G 
IR+ that satisfies the following two conditions makes 
incoming agents simulate algorithm A by acting selfishly; 

1. For any ;• e {1 F (/? (By)) -P{L(Bj)) ~ Dj. 

2. For any u,v G V' s.t. there are no unoccupied slots 

between them: —d{u,v) <P{u)—P{v) <d{u,v). 

Just as in Lemma 5.1, the second condition yields 
that if an agent’s goal vertex is vacant, she will occupy 
it. Let V G Bj be the incoming agent’s goal for some 
j G and let v) be the first vertex in Vj (the one 

with minimal index i) such that v is not right of it. We have 

that Pr [A matches v to vi\ = pi {v'^. The probability that 
the selfish agent will occupy vl is: 


Pr [v occupies vi] 


Px[dL+P{vL) < Bfl+F(v«)] 
Pr [P {vr)-P {vl) >dL- dR] 
Px[P{vR)-P{vL)>Ai\ 
\-Px\P{vr)-P{vl)<A,] 
\-F{At) 

pM). 


where the third equality stems from the fact that A, > 
dL — dR> A,_i. 


Algorithm 5 A dynamic pricing scheme that makes self¬ 
ish agents simulate algorithm A. 

• Letqir^Di,g2-^D2,...,qe^D(. 

• Let L{v) be the set {y G {1,... ,f} : v is right of By}. 

• Vv G y', set P{v) = c + Y^jeLivi^lj for some constant 
c. 


The dynamic pricing scheme given in Algorithm 5 
satisfies the conditions for simulating algorithm A. □ 

We conclude: 

Corollary 5.2. Given an estimate of the cost of the 
optimal matching by a poly(n) factor, there exists a 
payments scheme that gives an 0{\ogn) approximation 
to the optimal assignment in weighted line graphs in 
expectation. 

Proof. Let Z be an estimate, and p be some polyno¬ 
mial such that OPT/p(n) < Z < p{n) OPT. First, ap¬ 
ply the metric change using Z' = Z • p{n) as input. Af¬ 
terwards, use a dynamic pricing scheme that simulates 
the harmonic algorithm according to the transformed met¬ 
ric. There exists such a pricing scheme according to The¬ 
orem 5.3. Since Z' G [OPT,p(n)^OPT], this yields an 
O (log {p{n)^ • n)) = O (logn) approximation to the opti¬ 
mal assignment in expectation. □ 

6 Discussion 

Observable History For some arbitrary function h, we 
define the fi-observable history after event i — I to be 
h{a,s) where a = o’],..., (7,_ 1 is the sequence of previous 
agent types and s = si,..., 1 the sequence of decisions 

made thus far. 

If for some non-invertible function q we have that 

g(a,s) = q{g'{a,s)), Va,s, 

we say that g hides more than g'. 

This suggests a tradeoff between the price of anarchy 
of a dynamic posted pricing scheme and the information 
available to it. 

We remark that the study of fi-observable histories 
also makes sense for arbitrary online problems, in the con¬ 
text of mechanism design (not restricted to posted prices) 
and in the non-strategic setting of online algorithms. 

Pricing With posted prices, prices are determined before 
the event occurs. 

For non-posted price mechanisms — one issue is 
“when will the price be determined?” Is the payment 
determined promptly (when the agent arrives) or only 
later? In [7] this issue came up in the context of an online 
Groves mechanism. 





Commitment to pricing, Rate of Price Change Obvi¬ 
ous disadvantages of dynamic pricing include (a) prices 
are unknown, you plan to go downtown but don’t know 
what it will cost you, (b) you may start circling the block 
waiting for the prices to change. Thus it would be highly 
advantageous to be able to (a) commit to prices in advance 
and (b) slow the rate of change. 

One interesting model is to have the price function 
Pi depend, not on the events ffi,... and decisions 
but on a shorter prefix of these sequences. 
This has the effect of committing to prices in advance. 
A natural question is how does this impact the price of 
anarchy? 

Simulating Simple Online Algorithms It is not clear if 
or how one can convert an arbitrary online algorithm to a 
dynamic pricing scheme. The simulation technique that 
we’ve repeatedly used above seems to work when the 
algorithm is particularly simple. For example, it is not 
clear how to give dynamic posted prices with good price 
of anarchy for the problem of metrical matching on metric 
spaces other than the line. One desirable goal would be 
to find alternative online algorithms, possibly with higher 
competitive ratios, and try to find pricing schemes that 
simulate these algorithms. 

Structural Relationships Given an online setting, it is 
clear that the following weakly increase: 

• The competitive ratio of an online algorithm. 

• The price of anarchy achievable by a dominant strat¬ 
egy incentive compatible mechanism, where the true 
agent types are revealed. 

• The price of anarchy achievable by a dominant 
strategy incentive compatible mechanism, when re¬ 
stricted to the current agent type and some h- 
observable history of previous agents. 

• The price of anarchy achievable by a dynamic pric¬ 
ing scheme restricted to /z-observable history. 

However, it is not clear if there is any stronger relation that 
holds in general. Moreover, these can be further refined, 
leading to additional questions, e.g.: 

• How does promptness of payment determination im¬ 
pact the price of anarchy of mechanisms? 

• Given that a mechanism or pricing schemes uses h- 
observable history, how does the price of anarchy 
depend on properties of hi 
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A Appendix 

A.l An Illustration of the Fractional Traversal Algo¬ 
rithm The following is an example illustrating the exe¬ 
cution of Algorithm 1 given three tasks as input; 


Example 1 The fractional traversal algorithm applied 
to a metrical task system with states {1,2,3}, distances 
r/i 2 = 2, c/i 3 = 3, r/ 2,3 = 4, and traversal sequence T = 
(1,2,1,3)*. The example and the underlying metric are 
also illustrated._ 

• Task = (3,6,3): 

- A/ = 2/3, i.e., process two thirds of task in 
state Ti = 1, for a work expenditure of 3 • 2/3 = 
2. Increase j by one. 

- A} = 1/3, i.e., process the remaining 1/3 of 
task w' in state T 2 = 2 , for a work expenditure 
of 6/3 = 2. At this point the algorithm will 
increase j by one. Move to state T 3 = 1. 

• Task = (1,3,4): A 3 = 1, for a work expenditure 
of 1 in state T 3 = 1 , at this point of time pj = p 3 = 1. 

• Taskw^ = (10,10,10); 

- A 3 = 2 / 10 , for a work expenditure of 2 in state 
T 3 = 2. Increase j by one. Move to state T 4 = 3. 

- A 4 = 3/10, for a work expenditure of 3 in state 
T 4 = 3. Increase j by one. Move to state T 5 = 1. 

- Aj = 2 / 10 , for a work expenditure of 2 in state 
T 5 = 1. Increase j by one. Move to state Te = 2. 

- Ag = 2 / 10 , for a work expenditure of 2 in state 
Tg = 2. Increase j by one. Move to state T 7 = 1. 

- A/ = 1 /lO, for a work expenditure of 1 in state 
T 7 = 1 , at this point of time pj — 1. 
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